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2  Description  of  Research: 

Nonlinear  and  linear  systems  of  equations  often  arise  in  scientific  computation, 
for  example  in  implicit  methods  in  Computational  Fluid  Dynamics  (CFD).  It 
is  important  to  find  cost-effective  and  accurate  methods  to  solve  such  systems. 
Iterative  methods  are  among  those  widely  used,  especially  for  3D  problems. 
In  this  project,  we  consider  iterative  methods  which  are  especially  suited  to 
massively  parallel  architectures. 

To  accelerate  convergence  of  these  iterative  methods,  preconditioners  are 
often  used.  Good  preconditioners  reduce  the  number  of  iterations  and  involves 
few  arithmetic  operations  per  iteration.  Effective  parallel  preconditioners  must 
account  for  the  global  coupling  inherent  in  elliptic  problems.  On  the  other  hand, 
efficient  parallel  implementation  often  favors  local  computations.  Multiscale 
iterative  methods  represent  a  good  compromise  between  these  two  conflicting 
goals.  We  focused  our  attention  on  two  classes  of  multiscale  preconditioners: 
multilevel  basis  preconditioners  and  domain  decomposition  preconditioners. 
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Domain  decomposition  methods  are  techniques  which  solve  a  boundary  value 
problem  posed  on  a  general  domain  by  patching  together  solutions  on  small 
subdomains,  usually  in  an  iterative  fashion.  They  seem  ideally  suited  for  coarse 
grain  parallel  machines,  with  each  subdomain  assigned  to  a  single  processor. 
We  derived  efficient  methods  in  this  class,  for  overlapping  and  nonoverlapping 
subdomains,  for  convection-diffusion  problems,  for  unstructured  meshes  and  for 
partitioning  meshes. 

For  the  successful  application  of  preconditioned  iterative  methods,  in  addi¬ 
tion  to  efficient  preconditioners,  robust  and  efficient  convergence  acceleration 
and  stabilization  methods  are  also  needed.  We  investigated  ways  to  construct 
conjugate  gradient  like  methods  for  nonsymmetric  systems  which  possess  some 
of  the  following  desirable  properties:  short  recurrences,  transpose-free,  no  break¬ 
downs,  smooth  convergence  and  efficient  use  of  matrix-vector  multiplies.  We 
have  developed  a  simple  technique  called  composite  step ,  which  can  overcome 
one  kind  of  breakdowns  in  BCG  methods.  We  also  give  one  of  the  first  con¬ 
vergence  proofs  of  the  BCG  method  by  analyzing  as  a  Petrov- Galerkin  method 
defined  on  Krylov  subspaces  and  use  the  theory  of  Babuska  and  Azis.  Moreover, 
we  have  now  extended  this  composite  step  technique  to  stabilize  other  product 
methods  derived  from  BCG,  such  as  CGS,  BiCGSTAB  and  several  other  vari¬ 
ants. 

Finally,  we  also  studied  some  basic  linear  algebra  problems,  including  rank- 
revealing  QR  factorizations  and  stable  Toeplitz  solvers,  which  have  applications 
to  signal  and  image  processing. 


3  Summary  of  Most  Important  Results 


1.  We  extended  existing  domain  decomposition  and  multigrid  algorithms  for 
elliptic  problems  to  unstructured  grids.  The  key  issues  are  how  to  decom¬ 
pose  the  grid  into  subgrids,  and  how  to  construct  the  coarse  grid.  We 
proved  convergence  without  assuming  the  space  of  coarse  grid  functions 
are  nested  within  the  fine  grid  function  spaces  and  we  also  allow  the  coarse 
grid’s  boundary  to  not  match  that  of  the  fine  grid. 


2. 


3. 


We  proved  that  the  “Recursive  Spectral  Bisection”  method  for  partition¬ 
ing  unstructured  mesh  is  optimal  in  the  sense  that  it  computes  the  nearest 
discrete  feasible  solution  to  the  optimal  solution  of  a  continuous  form  of 
the  discrete  graph  optimization  problem  (which  is  computed  by  solving 
an  eigenvalue  problem).  Based  on  this  result,  we  developed  a  more  effi¬ 
cient  variant  of  the  algorithm  based  on  using  the  sign  of  the  eigenfunction 
instead  of  its  median. 

We  developed  ’’boundary  probing”  interfacial  preconditioners  in  domain 
decomposition,  which  adapts  to  the  coefficients  of  the  differential  operator 
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and  the  geometry  of  the  subdomains.  We  extended  this  to  derive  a  more 
efficient  version  of  the  vertex  space  method  of  Smith  and  Widlund. 

4.  We  have  developed  a  simple  technique  called  composite  step ,  which  can 
overcome  one  kind  of  breakdowns  in  BCG  methods.  We  also  give  one 
of  the  first  convergence  proofs  of  the  BCG  method  by  analyzing  as  a 
Petrov-Galerkin  method  defined  on  Krylov  subspaces  and  use  the  theory 
of  Babuska  and  Azis.  Moreover,  we  have  now  extended  this  composite  step 
technique  to  stabilize  other  product  methods  derived  from  BCG,  such  as 
CGS,  BiCGSTAB  and  several  other  variants. 

5.  Using  a  result  on  the  relationship  between  the  overlapping  and  nonover¬ 
lapping  methods  in  domain  decomposition,  we  proved  that  many  domain 
decomposition  algorithms  have  convergence  rates  that  are  independent  of 
the  aspect  ratios  of  the  subdomains  or  the  amount  of  overlap. 

6.  We  developed  a  block-ILU  factorization  technique  for  block-tridiagonal 
discretization  matrices  that  need  not  necessarily  be  M-matrices.  It  is 
based  on  combining  the  multigrid  idea  with  the  block  ILU  factorization. 
We  proved  the  existence  of  the  factorization  and  derived  estimates  for  the 
spectral  equivalence  relation  between  the  original  matrix  and  the  proposed 
preconditioner. 

7.  Developed  look-ahead  Levinson  algorithms  which  are  stable  for  a  much 
more  general  class  of  Toeplitz  systems,  including  indefinite  ones,  while 
retaining  the  efficiency  of  the  classical  algorithm.  Fortran  and  MATLAB 
codes  have  been  developed  also. 

8.  Developed  rank- revealing  QR  factorization  algorithms,  which  can  be  viewed 
as  inexpensive  alternatives  to  the  singular  value  decomposition.  We  de¬ 
veloped  applications  to  total  least  squares,  subset  selection,  rank-deficient 
least  squares. 
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